Mining Web navigation patterns with a path traversal graph

نویسندگان

  • Yao-Te Wang
  • Anthony J. T. Lee
چکیده

With the expansion of e-commerce and mobile-based commerce, the role of web user on World Wide Web has become pivotal enough to warrant studies to further understand the user’s intent, navigation patterns on websites and usage needs. Using web logs on the servers hosting websites, site owners and in turn companies, can extract information to better understand and predict user’s needs, tailoring their sites to meet such needs. The former mining algorithms do not provide a clear picture of the intentions of the visitors and suffer from drawback of either repetitive database scan or high memory load. This paper uses the concept of throughout-surfing patterns(TSPs) and proposes an efficient algorithm for mining the patterns, that effectively predict and display the trends toward the next visited Web pages in a browsing session with a view to better understand the purposes of website visitors. It also uses a compact graph structure, termed a path traversal graph, to record information about the navigation paths of website visitors, required for mining TSPs. In addition, it proposes a new algorithm for graph traversal based on the prior graph structure to discover the TSPs. The experimental results show the proposed algorithm is highly efficient to discover TSPs, by improving the accuracy, reducing the execution time and memory requirements with a single scan on the database & avoiding generation of candidate sequence as like apriori. Keywords— Web Log Mining, Path traversal graph, Throughout-surfing pattern, Browsing behavior, web Traversal pattern ——————————  ——————————

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Review on Path Traversal for Web Navigation Mining

Web Navigation Pattern is point comes under Web Usage Mining which shows how one can visited from one page to another i.e. it shows navigational behaviour. Mostly this pattern mining is success part of ecommerce and mobile commerce. Analysing this data will help the organizations to realize the lifetime value of their clients, and provide them with a more sophisticated structure of the web site...

متن کامل

Efficient Mining Web Navigation Pattern using an Efficient Graph Traverse Algorithm

In the modern world navigational behaviour of website visitor is an important role. In this paper implemented the web navigational pattern for college websites. Traditional method of web usage mining approach gives inefficient result for their web navigational pattern. So to overcome this, in this paper proposed an algortihm through-surfing pattern (TSP) from incremental database for college we...

متن کامل

Web Users Session Analysis Using DBSCAN and Two Phase Utility Mining Algorithms

One of the important issues in data mining is the interestingness problem. Typically, in a data mining process, the number of patterns discovered can easily exceed the capabilities of a human user to identify interesting results. To address this problem, utility measures have been used to reduce the patterns prior to presenting them to the user. A frequent itemset only reflects the statistical ...

متن کامل

Naviz: User Behavior Visualization of Dynamic Page

Navigational behavior of website visitors can be extracted from web access log files with data mining techniques such as sequential pattern mining. Visualization of the discovered patterns is very helpful to understand how visitors navigate over the various pages on the site. Currently several web log visualization tools have been developed. However those tools are far from satisfactory. They d...

متن کامل

Mining Top-K Path Traversal Patterns over Streaming Web Click-Sequences

Online, one-pass mining Web click streams poses some interesting computational issues, such as unbounded length of streaming data, possibly very fast arrival rate, and just one scan over previously arrived Web click-sequences. In this paper, we propose a new, single-pass algorithm, called DSM-TKP (Data Stream Mining for Top-K Path traversal patterns), for mining a set of top-k path traversal pa...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Expert Syst. Appl.

دوره 38  شماره 

صفحات  -

تاریخ انتشار 2011